Method and apparatus for enabling purchase of or information requests for objects in digital content

ABSTRACT

A client computing system renders digital content (such as, for example, a movie or TV program) for a user. Based on the user&#39;s interest, the user may select an object image in a video frame of a sequence of video frames of the digital content being rendered. The user input data indicating the object image selection may be sent to a service provider server coupled to the client computing system over a network via a secure communications channel. The server obtains information about an object corresponding to the user-selected object image. The server sends the object information to the client computing system for display to the user. The object may be an object in the physical world, such as a consumer good. The object information may include information enabling the user to purchase the object.

FIELD

The present disclosure generally relates to the field of computing system architectures for securely processing digital content. More particularly, an embodiment of the invention relates to enabling the purchase of, or information requests for, objects in digital content being processed by a computing system.

BACKGROUND

On open computing platforms, such as a personal computer (PC) system for example, when playing premium content (such as from a DVD, Blu-Ray, etc.), the digital rights management (DRM) processing and key management are typically performed in software by a media player application program. These schemes are not well protected and there have been instances of hacking, resulting in pirated content and loss of revenue to content owners. When content is played, even though the media decompression (such as H.264, MPEG-2, etc.) is done in hardware, the content is in the clear in system memory and can be stolen with software-based and/or hardware-based attacks. Due to these noted security weaknesses, only lower fidelity (such as standard definition (SD)) content or less valuable high definition (HD) content is typically distributed to open computing platforms. Improvements to the secure handling of digital content by open computing platforms (such as a PC, for example) are desired.

BRIEF DESCRIPTION OF THE DRAWINGS

The detailed description is provided with reference to the accompanying figures. The use of the same reference numbers in different figures indicates similar or identical items.

FIG. 1 is a diagram of a secure content processing pipeline according to an embodiment of the present invention.

FIG. 2 is a diagram of a service provider server and security services infrastructure according to an embodiment of the present invention.

FIG. 3 is a diagram of a client computing system according to an embodiment of the present invention.

FIG. 4 is a flow diagram of secure content processing according to an embodiment of the present invention.

FIG. 5 is a diagram of a secure content processing system according to an embodiment of the present invention.

FIG. 6 is a flow diagram of object information processing according to an embodiment of the present invention.

FIG. 7 is a diagram of object information processing components on a service provider server according to an embodiment of the present invention.

DETAILED DESCRIPTION

Embodiments of the present invention comprise a system architecture that provides a hardware-based root of trust (HW ROT) solution for supporting distribution and playback of premium digital content. In an embodiment, HW ROT for digital content and services is a solution where the basis of trust for security purposes is rooted in hardware and firmware mechanisms in a client computing system, rather than in software. From this root of trust, the client computing system constructs an entire media processing pipeline that is protected for content authorization and playback. In embodiments of the present invention, the security of the client computing system for content processing is not dependent on the operating system (OS), basic input/output system (BIOS), media player application, or other host software. In order to compromise the system, one will need to compromise the hardware and/or firmware mechanisms, as opposed to attacking the software running on top of the OS.

Embodiments of the present invention provide services for purchase of, or information requests for, objects shown in displayed images in digital content being processed by the client computing system.

In the following description, numerous specific details are set forth in order to provide a thorough understanding of various embodiments. However, various embodiments of the invention may be practiced without the specific details. In other instances, well-known methods, procedures, components, and circuits have not been described in detail so as not to obscure the particular embodiments of the invention. Further, various aspects of embodiments of the invention may be performed using various means, such as integrated semiconductor circuits (“hardware”), computer-readable instructions organized into one or more programs stored on a computer readable storage medium (“software”), or some combination of hardware and software. For the purposes of this disclosure reference to “logic” shall mean hardware, software (including for example micro-code that controls the operations of a processor), firmware, or some combination thereof.

Embodiments of the present invention protect content protection processing, key management processing, and content playback by using firmware and hardware in the CPU, chipset and integrated graphics/media engine of a client computing system 101 to perform these functions. Embodiments of the present invention provide end-to-end protection of the content as the content is processed by components within a computing system. FIG. 1 is a diagram of a secure content processing pipeline 100 according to an embodiment of the present invention. Content 102 may be accessible by a service provider (SP) server 104. Content 102 may be any digital information, such as audio, video, or audio/video data, images, text, books, magazines, games, or application programs. Service provider server 104 may include one or more servers for providing the content to a client computing system over any telecommunications channel (such as the Internet, cellular networks, wired or wireless networks, etc.). Content may be protected by any known content protection technology 106 (e.g., digital rights management (DRM) technology, cryptographic techniques, etc.) while stored in the SP server and during transfer to the client computing system 101. In one embodiment, the content may be protected by the Enhanced Privacy ID (EPID) signature verification protocol as discussed herein. In one embodiment, video data may be encrypted using the Advanced Encryption Standard (AES) cryptographic processing with CTR mode. The client computing system 101 may be a PC, laptop, netbook, tablet computer, handheld computer, smart phone, personal digital assistant (PDA), set top box, consumer electronics equipment, or any other computing device capable of receiving, storing and rendering content.

Within the client computing system, content protection processing 110 may be performed by a Security Processor 108. In one embodiment, the security processor may be within a chipset of the client computing system. In an embodiment, the chipset comprises a platform control hub (PCH). In another embodiment, the Security Processor may be within the CPU of the client computing system. In another embodiment having a system-on-chip (SOC) configuration, the Security Processor may be integral with other system components on a single chip. In one embodiment, the security processor comprises a Manageability Engine (ME). In other embodiments, other types of security processors may be used. The Security Processor is a subsystem implemented in hardware and firmware that interacts with other components of the client computing system. The Security Processor operates by loading firmware code from a protected flash memory region and executing the firmware code in protected memory. Since the content protection processing is performed in hardware and firmware within the Security Processor, protection of the content may be improved over software-based systems.

Cryptographic key information may be sent from the security processor over a protected chip to chip interconnect 112 to a component containing a central processing unit (CPU) and an integrated graphics (GFX)/media engine. In an embodiment, the protected chip to chip interconnect 112 comprises a secure Direct Media Interface (DMI) communications link to the CPU/GFX component. DMI comprises a chip-to-chip interconnect with two unidirectional lanes of concurrent data traffic, and isochronous transfer with improved quality of service. Data transferred over the DMI link may be protected by known cryptographic processing techniques. In an embodiment, the chip-to-chip secure link may be used for passing encrypted title keys over the DMI. Security is based on a shared secret between the PCH and the CPU. This shared secret may be established on each power cycle and can vary between families of products, generations and random groupings as needed to ensure protection and integrity of the shared secret. The DMI mechanism is independent of the OS, the BIOS, and software running on the CPU. The DMI mechanism may be used to create a trust relationship between the security processor (in the PCH) and the CPU.

The GFX engine 114 may include content protection processing to decrypt the content. The GFX engine also includes decoder logic 121 to process/decode the decrypted audio/video content and pass the audio/video content as media blocks to a graphics processing unit (GPU) within the GFX engine 114. The GPU includes security techniques, including using encoder logic 123, to protect the media blocks during processing in memory. GFX engine 114 also includes composition logic 125 to compose the image data to be shown on display 118. As the content is being handled within and between the security processor in the PCH and the GFX engine in the CPU/GFX component, the content may be protected by a hardware protected data path 116. In an embodiment, the hardware protected data path comprises a Protected Audio Video Path (PAVP) to maintain the security of the content. PAVP also supports an encrypted connection state between system components. By using the PAVP, the system may further protect the content during transfer between system components and within memory.

The interface between the GFX engine, the PCH, and the display 118 may be implemented by protected wired/wireless display links 120. In one embodiment, display data sent from the GFX engine via a memory through the PCH to the display may be protected by a High-Bandwidth Digital Content Protection (HDCP) content protection scheme. The HDCP specification provides a robust, cost-effective and transparent method for transmitting and receiving digital entertainment content to compliant digital displays. In an embodiment, the wired link may be implemented according to the HDCP Specification, Revision 2.0, available from Digital Content Protection, LLC, or subsequent revisions. HDCP may be employed to deter copying of the display data as the data travels over a DisplayPort, Digital Visual Interface (DVI), High-Definition Multimedia Interface (HDMI), Gigabit Video Interface (GVIF), or a Unified Display Interface (UDI) connection. The HDCP revision 2.0 specification addresses emerging usage models that let end users conveniently connect displays, devices and home theater systems via standard protocols and interfaces like TCP/IP, USB, Wi-Fi and WirelessHD. The HDCP revision 2.0 specification uses standards-based RSA public key and Advanced Encryption Standard (AES) 128-bit encryption for robust content protection. In an HDCP system, two or more HDCP devices are interconnected through an HDCP-protected interface. The audiovisual content protected by HDCP flows from the Upstream Content Control Function into the HDCP system at the most upstream HDCP Transmitter. From there, the HDCP content, encrypted by the HDCP system, flows through a tree-shaped topology of HDCP receivers over HDCP-protected interfaces.

The HDCP content protection mechanism includes three elements: 1) Authentication of HDCP receivers to their immediate upstream connection (to an HDCP transmitter). The authentication protocol is the mechanism through which the HDCP transmitter verifies that a given HDCP Receiver is licensed to receive HDCP. 2) Revocation of HDCP receivers that are determined by the DCP to be invalid. 3) HDCP encryption of audiovisual content over the HDCP-protected interfaces between HDCP transmitters and their downstream HDCP receivers. HDCP receivers may render the HDCP content in audio and visual form for human consumption. HDCP receivers may be HDCP repeaters that serve as downstream HDCP transmitters emitting the HDCP content further downstream to one or more additional HDCP receivers. In one embodiment, display data sent to the display 118 may be sent over a protected wireless display (WiDi) link 127 using 802.11n wireless local area network (WLAN) technology.

As can be seen from FIG. 1, in embodiments of the present invention, from the time the content is received from the service provider server 104 until the content is displayed on the display 118, no cryptographic key or content is available in unencrypted form to any software or unauthorized hardware running on the computing system. Further, memory protection for video data is offered over the whole chain across the decrypt, decode/encode, compose and display pipelines. This capability is offered at the full memory bandwidth without compromising overall system performance.

FIG. 2 is a diagram of a service provider server 104 and a security services component 202 according to an embodiment of the present invention. In an embodiment, security services component 202 may comprise one or more servers and/or components. In an embodiment, the security services component may be operated by the manufacturer of one or more components of the client computing system. The security services component provides capabilities for controlling client computing systems in the field. The security services component comprises a manufacturing component and a deployment component. The manufacturing component includes a certificate issuance component 218, a key generation (Key Gen) component 220, and a fuse programming (Fuse Prog) component 222. Certificate issuance 218 generates and issues public key certificates to each of the client computing platforms. Key Gen 220 is responsible for generating the private and public key pairs as needed for embedding into the client computing platforms. Fuse programming 222 is responsible for programming the fuses on the manufacturing floor with appropriate values in a robust and secure manner. These values would be used by the client computing platform to build up the trust anchors and key ladders inside the security processor.

The deployment component includes a certificate issuance component 204, a key generation (Key Gen) component 206, and a revocation manager 208. Certificate (Cert) issuance component 204 issues a digital certificate for the SP Server and Client components giving them the authorization to interact with such client systems for service deployment. Key generation (Key Gen) component 206 generates a cryptographic signing key pair, a root key pair, digital certificates, and group public keys, and signs the group public keys for each group. Revocation manager 208 determines identifiers and signatures of client computing systems to be added to a revocation list (RL), updates the RL, and distributes updated RLs.

The SP server 104 communicates over network 201 (such as the Internet) with the client computing system. The service provider server comprises a SP server application 212 and a SP server agent 210. The SP server application provides content browsing capabilities. The SP server agent 210 controls the sending of client specific messages, manages cryptographic keys and authorized user tokens, and maintains content delivery service status (for deployment purposes 212 and 210 could be physically different servers firewalled and separated). Content encrypter 214 accepts content 102 and encrypts the content for secure delivery to a client computing system. Content server 216 sends the encrypted content to the client. Key server 226 is responsible for provisioning the title keys to the client computing systems within an authenticated session. Server certificate 224 is used by the SP server agent to participate in mutual authentication and establishment of the authenticated session with the client computing systems. Communications links between SP server agent 210, key server 226, and content server 216 are protected by well accepted information security practices. The key server has the highest network and access protection to ensure only authorized parties are able to reach it and the keys managed by the key server are isolated and firewalled from attackers from outside network entities. The SP server agent or the key server has access to the private key associated with the server certificate 224. In an embodiment, this private key and all operations done with this private key are protected using a hardware security module (HSM) (not shown in FIG. 2) on the server.

In an embodiment, the cryptographic scheme used to authenticate the client computing system with the SP server comprises a cryptographic authentication protocol protecting a user's privacy based on the use of zero-knowledge proofs. In an embodiment, the cryptographic authentication protocol comprises the Enhanced Privacy ID (EPID) scheme, a Direct Anonymous Attestation (DAA) scheme with enhanced revocation capabilities. EPID mitigates the privacy issues of common Rivest, Shamir, Adleman (RSA)—public key infrastructure (PKI) security implementations where every individual is uniquely identified for each transaction. Instead, EPID provides the capability of remote attestation but only identifies the client computing system as having a component (such as a chipset) from a particular technology generation. EPID is a group signature scheme, where one group's public key corresponds to multiple private keys, and private keys generate a group signature which is verified by the group public key. EPID provides the security property of being anonymous and unlinkable—given two signatures, one cannot determine whether the signatures are generated from one or two private keys. EPID also provides the security property of being unforgeable—without a private key, one cannot create a valid signature.

Generally, setting up a secure communications channel with EPID may be accomplished as follows. A first party (such as the client computing system) sends an EPID certificate to a second party (such as the service provider server). Never knowing the identity of the first party and only knowing the first party is a computing platform with a trusted security processor, the second party authenticates the first party. The first party then authenticates the second party using the second party's public key certificate. Since the second party doesn't require privacy, the second party's public key certificate may not be an EPID certificate (but it could be). The parties may then enter into a Diffie-Hellman (DH) key exchange agreement.

Various suitable embodiments of DAA and EPID are described in the following co-patent applications, incorporated herein by reference: “An Apparatus and Method of Direct Anonymous Attestation from Bilinear Maps,” by Ernest F. Brickell and Jingtao Li, Ser. No. 11/778,804, filed Jul. 7, 2007; “An Apparatus and Method for a Direct Anonymous Attestation Scheme from Short-Group Signatures,” by Ernest F. Brickell and Jingtao Li, Ser. No. 12/208,989, filed Sep. 11, 2008; and “Direct Anonymous Attestation Scheme with Outsourcing Capability,” by Ernest F. Brickell and Jingtao Li, Ser. No. 12/286,303, filed Sep. 29, 2008. In other embodiments, other authentication and attestation schemes may also be used.

The client computing system comprises at least three main components—host software, chipset hardware/firmware, and the CPU/GFX/Media engines. It is assumed in embodiments of the present invention that the host software is untrusted. Even if the host software gets attacked, no secrets will be compromised. Host software is responsible for network connection to SP server 104 and downloading media from content server 216. Host software acts as a proxy between various SP servers and the chipset hardware/firmware. Host software sends encrypted content directly to the graphics hardware after the chipset hardware/firmware has completed title key unwrap and injection into the CPU/GFX component.

Chipset hardware/firmware is responsible for all protected processing, taking the role of the protected device for content protection processing. In an embodiment, the chipset hardware/firmware sends protected title keys to the graphics hardware using the DMI mechanism.

The CPU/GFX component is responsible for final stream decryption, decode and display. The GFX engine is a passive device, making no policy decisions. When asked, the GFX engine simply decrypts the content, then decodes the submitted video slices. In an embodiment, the GFX engine (with protected media encoders) re-encrypts the display content for HDCP output protection over HDMI and wireless (e.g., WiDi) displays.

A protected client computing system must be remotely identified by a service provider before sending highly sensitive information. The mechanism used to identify the platform must not violate user privacy. Embodiments of the present invention provide a protected mechanism for a service provider to verify over the network that the service provider server is communicating to a suitable client computing system and transfer title keys and other confidential material to that client computing system. In one embodiment, the protocol utilized to establish a protected session between the service provider server and the client computing system is EPID. EPID allows for a single public key to anonymously verify the signature generated by N-private keys in what is called an EPID group. To implement EPID, each chipset contains a unique private key blown into the platform control hub (PCH) fuses during silicon manufacturing. In an embodiment, the chipset manufacturer places 1,000,000 private keys in a single group and produces 400 groups for each chipset produced. In order to act as the EPID verifier, each service provider will be provisioned with these 400 public keys.

Once a protected EPID session has been established, the service provider server is free to exchange protected confidential information with the protected client computing system. For content streaming, protected title keys may be passed from an SP server to the security processor in the chipset. The security processor sends the protected title keys to the graphics and audio hardware. At this point, encrypted video and audio content can be directly sent from a content server 216 to client computing system graphics and audio hardware which decrypts, decodes, and displays the content. For downloading content, the security processor binds the title keys to the client computing system using a unique platform storage key (again burned into PCH fuses during manufacturing) and returns the bound keys to media player software. When playback is desired, the bound title keys are re-submitted to the security processor, which unbinds and sends them in a protected manner to the graphics and audio hardware.

FIG. 3 is a diagram of a client computing system 101 according to an embodiment of the present invention. A service provider (SP) player/media browser software application 302 may be included in the software stack to interface with the SP server 104 over a network 201 such as the Internet. The SP player/media browser 302 allows a user to browse content offerings of the service provider and to select content to be delivered from the SP server to the client computing system. The SP player/media browser provides user interface controls for the user to manage a content library and to control the selection, downloading, and playback of content. The SP player/media browser interacts with service agent 304. Service agent 304 comprises a software application provided by a service provider that is authorized to access the features of the client computing system supporting end-to-end content protection according to embodiments of the present invention. The service agent interfaces with various SP player/media browser application programming interfaces (APIs) (not shown in FIG. 2). Service agent 304 comprises a media player component 306. The media player provides the content player functionality (e.g., controlling playback).

SP client application 308 enables the SP player/media browser 302 and the service agent 304 to access content protection features on the client computing system's hardware and firmware and for relaying messages to the service provider server 104. In an embodiment, the SP client application comprises a host agent software development kit (SDK) including content protection APIs. In an embodiment, the SP client application communicates with the security processor 314 in the platform control hub (PCH) 312 of the chipset.

Audio driver 311 provides an interface between the media player and audio decrypt hardware 316. Similarly, graphics (GFX) driver 310 provides an interface between the media player and the GFX engine 320. In an embodiment, the PCH 312 comprises security processor 314, which executes firmware to provide content protection functionality, along with other well known system functions. In an embodiment, the security processor may be implemented by a Manageability Engine (ME). As content is handled by the PCH 312 and the GFX engine 320, the content may be protected at least in part by Protected Audio Video Path (PAVP) components 318, 322 in the PCH hardware/firmware and GFX engine hardware, respectively.

FIG. 4 is a flow diagram of secure content processing according to an embodiment of the present invention. At block 402, a user of the client computing system uses SP player/media browser 302 to browse, discover, and purchase content from one or more service providers. At block 404, mutual authentication of the SP Server 104 and the client computing platform 101 is performed. An authenticated session is established. Key blobs with usage rights for a given set of content are provisioned. The key blobs are bound to the client computing system to ensure that the system is both confidentiality and integrity protected as necessary.

The client computing system then gets the encrypted content at block 406 from content server 216 over network 201 (for streaming operations) or from local storage on the client computing system (for content previously purchased, downloaded, and stored). The system is prepared to work on video slices (e.g., sub-frame). As a result, the hardware can process the data as soon as the first slice of data is submitted.

At block 408, the user initiates playback of the selected content using the SP player/media browser 302. The key blob is submitted to the security processor 314 for unpacking and extracting of the title key. When that is done, the title key is loaded by the security processor into the graphics hardware 320 for decryption. The SP player/media browser submits the encrypted content to the media processing engine within GFX engine 320 at block 410. The GFX engine decrypts the content using the title keys and re-encrypts the content using a local protected key. Re-encrypted data may be stored in protected local or system memory. The re-encrypted content is subsequently obtained, decrypted, and decompressed at block 414. The decrypt is performed first. Once the data is decrypted, the data is decoded/decompressed. Once the data is decompressed, the data is re-encrypted and passed to the composition engine via the system memory. Once the composition is finished, the data is again protected and passed using system memory to the display engine. In an embodiment, each component along the way has the ability to decrypt, process and re-encrypt as necessary.

At block 416, the GFX engine re-encrypts the media content using HDCP technology (in an embodiment) and delivers the content to the display for viewing by the user. At each step of the process, the content is never in the clear where it is accessible by software or unauthorized hardware components running on the client computing system.

FIG. 5 is a diagram of a secure content processing system according to an embodiment of the present invention. A SP server 104 interacts over network 201 to client computing system 101. Client computing system comprises first 500 and second components 502. In an embodiment, the first component comprises a CPU and GFX component, and the second component comprises a platform control hub (PCH). In another embodiment, the first and second components may be combined into a single component in a system-on-a-chip (SOC) implementation. First component 500 includes a plurality of processor cores 504, and GFX engine 320. Processor cores 504 execute various components of host software (SW) 506 (as described in FIG. 3), a client certificate 508, fuses 521, and a shared secret 519. Host SW reads data, including encrypted content previously obtained from a SP server or tangible media (such as a DVD, Blu-Ray, or other storage technology), from hard disk drive (HDD)/solid state drive (SSD) 510. In an embodiment, Host SW comprises at least a SP player/media browser application 302, a service agent 304, and a SP client application 308.

GFX engine 320 comprises a plurality of components. Media encrypt/decrypt engine 520 comprises logic to encrypt and decrypt content. Media encode/decode engine 522 comprises logic to encode and decode content. GFX Composition (Comp) engine 524 comprises logic to construct display images. Display engine 526 comprises logic to pass the composed display images to the display. Display encrypt/decrypt engine 528 comprises logic to encrypt and decrypt display data prior to sending the display data to display 538 over protected link 527. Memory encrypt/decrypt engine 530 comprises logic to encrypt and decrypt data stored in protected intermediate surfaces 534 in memory 536. Memory 536 also includes logic to implement confidentiality and integrity protected memory operations 532.

Second component 502 comprises a plurality of components, some of which are not shown in order to simplify FIG. 5. Second component comprises a security processor 314. Security processor includes firmware and/or hardware logic to provide attestation, provisioning key management, and output control operations 516 for the client computing system. Security processor also includes fuses 517, shared secret 519, and trust anchors 518 for supporting a PKI such as verification keys and key hierarchy information. Fuses 521, 517 are programmed into the hardware of the first and second components during manufacturing of the chipset with key material for EPID use. The hardware root of trust is built up from the information programmed into the fuses on the manufacturing floor when the client computing system is manufactured. This ensures that each individual client computing system is unique, yet privacy protected. Shared secret 519 is hard-coded into the hardware of the first and second components during manufacturing of the chipset and CPU/GFX components. In an embodiment, the shared secret may be used in setting up the secure chip to chip communications channel over the DMI link 538.

Client computing system also includes a protected real time clock 513 for providing secure clock services, a display 538, and a non-volatile memory (NVM) 512. In an embodiment, the protected real-time clock may be seeded by a third party, and may be virtualized for multiple service providers. The NVM may be used to store the firmware image for the second component, as well as to store temporary data (such as integrity and state information) for security processor processing operations.

In an embodiment, a processing flow may be described as follows. SP player/media browser 302 presents a user interface to the user. The user goes to the service provider's web site to browse available content. The SP web site has an auto detection capability to determine if the user's client computing system has integrated within it the capability of authenticating with the SP server 104. If capable, the user is allowed to choose content. The content may be bought, rented, or subscribed to, or may be streamed. The user pays for the content. SP player/media browser 302 invokes security processor 316 to authenticate the client computing system 101 with the SP server 104. In an embodiment, the authentication uses EPID technology. The client computing system 101 is authenticated at least in part by having the SP server 104 verify the client computing system's certificate 508, perform a revocation check, and verify a certification path to a certificate authority (using the EPID protocol in one embodiment). When both the client computing system 101 and the SP server 104 are authenticated, a secure communications channel may be set up based on the EPID protocol in one embodiment. In an embodiment, once the secure communication channel is set up, a command set may be used for end to end content protection capabilities.

The SP Server 104 provisions an encrypted title key to the client computing system, with constraints on usage of the content (e.g., time). The SP server sends the encrypted title key over the secure channel to security processor 314. Security processor 314 decrypts the encrypted title key, using its own key hierarchy. Security processor 314 uses a storage key to re-encrypt the newly decrypted title key to form a key blob. The key blob is bound to the client computing system for a specified time period. Security processor 314 sends the key blob to SP player/media browser 302 running in the CPU core. SP player/media browser 302 stores the key blob in HDD/SSD 510. SP player/media browser 302 then downloads the user-selected encrypted content. In one embodiment, the downloaded encrypted content may be stored in the HDD/SSD 510.

When a user wants to play the content, the SP player/media browser 302 submits the key blob back to the security processor 314. The security processor verifies the signature of the key blob, and checks usage constraints such as time, for example. The security processor 314 sends the encrypted title key over the encrypted channel (e.g., DMI link 538) to the media encrypt/decrypt component 520 of the GFX engine 320. The security processor instructs the SP player/media browser that the GFX/media engine is ready to process the encrypted content. The SP player/media browser 302 reads the encrypted content from the HDD/SDD 510, or obtains the encrypted content from the SP server 104 over the network 201 (for a streaming application), and sends the encrypted content to the GFX engine slice by slice.

The GFX engine 320 processes the encrypted content in a slice by slice manner. For each slice, the SP player/media browser reads the slice headers in the clear. The rest of the slice is encrypted so that the SP player/media browser cannot access the content. The SP player/media browser keeps track of playback state information using an initialization vector. The media encrypt/decrypt engine 520 decrypts the content using the title key, after decrypting the encrypted title key received from the security processor. In one embodiment, the output data of the media encrypt/decrypt engine is still compressed according to the well-known H.264 encoding scheme. In other embodiments, other encoding schemes may be used. The media encode/decode engine 522 decodes each slice and then re-encrypts the slice using memory encrypt/decrypt 530. The re-encrypted content slice is stored in protected intermediate surfaces 534 in memory 536. GFX composition engine 524 controls the composition of the image to be displayed on the display, including the foreground and background images, windows, etc. The GFX composition engine obtains the re-encrypted content slices from protected intermediate surfaces 534 in memory 536 to generate the composed image. The GFX composition engine 524 sends the composed image data to the display engine 526.

The display engine uses display encrypt/decrypt engine 528 to decrypt the composed image from the encryption that was used to store the content slices in memory 536. The display engine 526 uses the display encrypt/decrypt engine to re-encrypt the composed image data according to the HDCP technology, in one embodiment. The encrypted composed image data is sent by the GFX engine 320 over the protected chip to chip data interface (e.g., DMI link) 538 to the second component 502, for transfer to the display 538 over protected display interface link 527.

In an embodiment, there can be any number of concurrent, independent content streams being processed by the client computing system. Each content stream has its own cryptographic context so as not to interfere with other streams. This also allows for the client computing system to ensure that any kind of attack or compromise on one stream does not affect the other content streams.

Embodiments of the present invention provide the capability for purchasing goods and/or services related to, and/or obtaining information for, objects represented in digital content being rendered by the client computing system. FIG. 6 is a flow diagram of object information processing according to an embodiment of the present invention. At block 602, SP player/media browser 302 displays digital content for the user. As the user hears and sees the content, the user may be interested in something in the content. In an embodiment, the digital content comprises video, such as a movie or a TV program, for example. For example, the user may see an image of at least one object being displayed in at least one video frame of the sequence of video frames for the content. Generally, the at least one object image may be any image representing a physical world object captured by a camera as at least a portion of the video frame. For example, the object image may be an image in the video frame of a person, a plant or animal, a vehicle, a household item, a building, a landscape feature, a consumer good, and so on. In another example, the object image may an icon, symbol, or other representative image added to the video frame to represent an audio portion of the digital content, such as a song being played. In another example, the object may not physically exist in the physical world and may be a computer-generated, hand-drawn, or animated image added to the video frame. The user may also be interested in multiple objects at a time while consuming the content. In one illustrative example, the user may be watching a movie where an actor is wearing a coat and driving a car down a famous street in a particular city, and the movie soundtrack at that time comprises a song. In this example, the user may be interested in getting information (including purchase information, where appropriate and available) about one or more of the following objects: the actor, the coat, the car, the street, the city, and the song.

In an embodiment, when the user is interested in getting information about an object of interest, or perhaps purchasing the object, the user selects the object image using a user input mechanism available to the client computing system. In embodiments of the present invention, the available user input mechanism may comprise any user input device now known or to be developed, such as a computer mouse, keyboard, remote control device, game controller, touch screen, detected user gesture, etc. In an embodiment, the user may pause the playback of the content in order to provide the user input on a selected object image, and then resume playback when the user input selection is made. In other embodiments, pausing playback of the content may not be performed.

At block 604, the SP player/media browser accepts the input selection from the user selecting the at least one object image being rendered in at least one video frame of a sequence of video frames on the display. In an embodiment, the user input data may comprise at least one of the elapsed time since the start of the display of the content and a location in the video frame. For example, the user may select an object image being displayed at a particular (x, y) location in the video frame occurring during playback at an elapsed time t. In an embodiment, the time may be specified as hours, minutes, seconds, and milliseconds since the start of the displayed content. In another embodiment, the time may be specified as a counter from the start of the display content (e.g., a frame count or other counter). In another embodiment, only the elapsed time may be indicated by the user. In yet another embodiment, only the location may be indicated by the user.

As the user consumes the content and makes one or more selections of object images, the SP player/media browser in one embodiment stores the user inputs at block 606. In one embodiment, the SP player/media browser securely communicates and stores the user inputs in a secure storage location in NVM 512 using security processor 314 and DMI link 538. This helps to provide privacy for the user about content consumed and associated information requests and purchase transactions. In an embodiment, each individual user input may be stored as a separate operation. In another embodiment, user inputs from consumption of a particular digital content title (or portion thereof) may be stored in a single storage operation at the end of content consumption processing. Processing of blocks 602, 604, and 606 may occur even when the client computing system 101 is not connected to network 201.

In an embodiment, once the content is consumed and the client computing system is connected to the network, the SP player/media browser uses service agent 304, SP client application 308, and security processor 314 to obtain the stored user inputs from NVM 512 and to securely transmit the user inputs to SP server application 212 on SP server 104 using methods discussed above. In another embodiment, if the client computing system is connected to the network while the content is being consumed, and if the user has selected he or she wants immediate feedback on information requests, the SP player/media browser may send each individual user input in real time to the SP server application on the service provider server for processing. Regardless of whether the user inputs are batched or handled individually, in one embodiment, the SP server application compares each user input to entries in a content object information database at block 610. In an embodiment, the comparison may involve attempting to match the time and location information from the user inputs to known information about the audio and video portions of the content.

FIG. 7 is a diagram of object information processing components on a service provider server according to an embodiment of the present invention. SP server application 212 securely receives one or more user inputs 702 from the client computing system over the network. The SP server application has access to content 102. In one embodiment, one or more components of the SP server, such as SP server application, for example, may be used to generate a content object information database 704 based on static analysis of the content. That is, the content may be analyzed for object images before distributing the content to any client computing system. The content object information database includes an entry for each object image in the content that is supported by the service provider for providing information to the user based on an information request. In an embodiment, a content object information database may be built by the service provider for each supported content title.

In an embodiment, prior to distribution of the content and/or supporting object information requests, service provider personnel analyze the audio and video portions of the content, identify object images, and create database entries for supported identified objects. In an embodiment, identified objects to be supported may be selected based on current advertising and/or marketing campaigns of the content provider, service provider, and/or another entity. In one embodiment, relevant information associated with supported objects may be obtained and stored in the content object information database. In another embodiment, links to the relevant information may be stored, and the information may be obtained as needed from any information source (e.g., the Internet, other data structures, other databases, etc.). In yet another embodiment, the service provider may dynamically change the information in the content object information database at any time for any selected supported object due to such things as changing market conditions, advertising campaign changes, current events such as weather, sports scores, news items, etc., time of day, time of year, and so on.

Information obtained from the content object information database 704 or obtained from other sources that is associated with the selected object may be identified as object information 710. For example, when the object is a consumer good, the object information may comprise one or more of the brand name, manufacturer, model number, title, price, availability, features, and so on. For example, when the object is a person, such as an actor or actress, the object information may include one or more of the person's biographical information, film resume, fan club information, link to a website, and so on. For example, when the object is a city or landmark, the object information may include information about visiting the city or landmark. For example, when the object image is representing a song, the object information may include one or more of song title, artist, album title, record company, and purchase information. Generally, object information comprises any information related to the object. In one embodiment, object information includes information about purchasing the object or a related good and/or service.

In an embodiment, the SP server application may securely maintain a user profile 706 for the user on the service provider server. The user profile comprises at least one of preferences and interests of the user. In an embodiment, the SP server application may select and/or tailor object information based at least in part on the user profile. In one embodiment, the user may enter or select his or her preferences and/or interests by affirmative action by the user using the SP player/media browser. In another embodiment, the SP server application may develop and maintain the user profile based at least in part on the content delivered to the user's client computing system and/or the user inputs received by the service provider server over time. In an embodiment, the preferences may include what types of information the user is interested in, and how, when and in what format the user wants to receive the information. In an embodiment, preferences may include purchase parameters, such as financial limits for purchases, purchase notifications and approvals, etc. Other preferences may also be specified. In an embodiment, interests may include things the user selects as interests (e.g., sports, hobbies, television, film, and music genres, etc.) as well as interests determined by analyzing past behavior, such as received user inputs and delivered and/or consumed content. In an embodiment, the user inputs may be stored as part of the user profile.

Returning back to FIG. 6, at block 612, when a match occurs when comparing the user input for a selected object (represented by an object image) and an entry in the content object information database, in one embodiment the SP server application obtains and stores information about the object corresponding to the selected object image into the user profile. In another embodiment, links to the relevant information may be stored in the user profile, and the information may be obtained as needed from any information source (e.g., the Internet, other data structures, other databases, etc.). At block 614, upon request, the SP server application on the service provider server sends the object information from the user profile (or obtains the object information from another source as needed) to the SP player/media on the client computing system. In one embodiment, this may occur when the user logs in to a web site operated by the service provider and requests one or more of the object information. At block 616, the SP player/media browser displays the requested object information to the user. In an embodiment, the object information may include purchase information allowing the user to complete a purchase transaction for the object. In an embodiment, blocks 608, 610, 612, and 614 may be performed in real time, without waiting for the user to log in to a website to request the object information. In an embodiment, the user may pause playback of the content to view the requested object information.

Referring back to FIG. 7, in one embodiment, dynamic analysis 708 may be performed by the SP server application on the content, instead of static analysis, resulting in stored content object information database entries. Dynamic analysis component 708 may analyze audio portions and video frames of content 102 based at least in part on received user input 702. The dynamic analysis component may use known audio recognition, image recognition, and/or facial recognition processing techniques to analyze one or more of the audio portions and video frames indicated by the user input to attempt to determine one or more selected object images or sounds in the content copy on the service provider server, as the content is being consumed on the client computing system. For example, if the content is being streamed to the client computing system, as the content is streamed over the network, the content on the server may be analyzed to determine objects in current content portions. The SP server application accepts the user inputs and associates them to the currently streamed content portions. When an object is identified by such dynamic analysis, then the SP server application obtains object information 710 that relates to the identified object, and continues with blocks 612, 614, and 616.

Embodiments of the present invention capture user interest in objects in a way that is more extensible and flexible than adding pre-determined hotspots into the video frames. In embodiments of the present invention, the service provider server may dynamically determine if there is anything interesting to the user in the content portions and take action independently, or take different actions for the same user input at different times.

Reference in the specification to “one embodiment” or “an embodiment” means that a particular feature, structure, or characteristic described in connection with the embodiment may be included in at least an implementation. The appearances of the phrase “in one embodiment” in various places in the specification may or may not be all referring to the same embodiment.

Also, in the description and claims, the terms “coupled” and “connected,” along with their derivatives, may be used. In some embodiments of the invention, “connected” may be used to indicate that two or more elements are in direct physical or electrical contact with each other. “Coupled” may mean that two or more elements are in direct physical or electrical contact. However, “coupled” may also mean that two or more elements may not be in direct contact with each other, but may still cooperate or interact with each other.

Thus, although embodiments of the invention have been described in language specific to structural features and/or methodological acts, it is to be understood that claimed subject matter may not be limited to the specific features or acts described. Rather, the specific features and acts are disclosed as sample forms of implementing the claimed subject matter. 

The invention claimed is:
 1. A method of processing digital content on a client computing system comprising: rendering digital content by the client computing system; displaying the digital content being rendered on a display of the client computing system; accepting user input data from a user selecting at least one object image within at least one video frame of a sequence of video frames of the digital content being displayed, wherein the user input data comprises: (i) a location of the selected at least one object image within the at least one video frame of the digital content being displayed and (ii) an elapsed time from a start of the rendering of the digital content being displayed; storing, by a security processor of the client computing system, the user input data in a secure storage of the client computing system; sending the user input data to a server coupled to the client computing system over a network via a secure communications channel; receiving information about an object corresponding to the selected object image from the server; and displaying the object information on the client computing system.
 2. The method of claim 1, wherein the selected at least one object image comprises an image of an object captured by a camera as at least a portion of at least one of the video frames.
 3. The method of claim 2, wherein the captured object comprises a physical world object.
 4. The method of claim 3, wherein the captured object comprises a consumer good.
 5. The method of claim 1, wherein the object information comprises information to enable the user to purchase the object.
 6. Amended) A client computing system for processing digital content comprising: a user input mechanism for entering user input data; a display; a processor adapted to: (i) render digital content; (ii) cause the display of the rendered digital content on the display; (iii) accept user input data from a user via the user input mechanism to select at least one object image within at least one video frame of a sequence of video frames of the digital content displayed on the display, wherein the user input data comprises: (a) a location of the selected at least one object image within the at least one video frame of the digital content displayed on the display and (b) an elapsed time from a start of the rendering of the digital content displayed on the display; (iv) send the user input data to a server coupled to the client computing system over a network via a secure communications channel; (v) receive information about an object corresponding to the selected object image from the server; and (vi) cause the display of the object information on the display; and a security processor adapted to store the user input data in a secure storage of the client computing system, wherein the security processor is different from the processor.
 7. The client computing system of claim 6, wherein the selected at least one object image comprises an image of an object captured by a camera as at least a portion of at least one of the video frames.
 8. The client computing system of claim 7, wherein the captured object comprises a physical world object.
 9. The client computing system of claim 8, wherein the captured object comprises a consumer good.
 10. The client computing system of claim 6, wherein the object information comprises information to enable the user to purchase the object.
 11. The client computing device of claim 6, wherein the security processor is further adapted to obtain the user input data from the secure storage.
 12. The client computing device of claim 6, wherein the processor is further adapted to: (i) cause the display of the rendered digital content to be paused, wherein to accept user input data from the user via the user input mechanism comprises to accept user input data from the user via the user input mechanism when the display of the rendered digital content is paused; and (ii) resume the display of the rendered digital content in response to accepting the user input data from the user via the user input mechanism.
 13. A computer-readable medium comprising one or more instructions that when executed by a client computing device cause the client computing device to: render digital content; display the rendered digital content on a display of the client computing device; accept user input data from a user selecting at least one object image within at least one video frame of a sequence of video frames of the digital content being displayed, wherein the user input data comprises: (i) a location of the selected at least one object image within the at least one video frame of the digital content being displayed and (ii) an elapsed time from a start of the rendering of the digital content being displayed; store, by a security processor of the client computing device, the user input data in a secure storage of the client computing device; send the user input data to a server coupled to the client computing device over a network via a secure communications channel; receive information about an object corresponding to the selected object image from the server; and display the object information on the display of the client computing device.
 14. The computer-readable medium of claim 13, wherein the selected at least one object image comprises an image of an object captured by a camera as at least a portion of at least one of the video frames.
 15. The computer-readable medium of claim 14, wherein the captured object comprises a physical world object.
 16. The computer-readable medium of claim 15, wherein the captured object comprises a consumer good.
 17. The computer-readable medium of claim 14, wherein the object information comprises information to enable the user to purchase the object.
 18. The computer readable media of claim 13, wherein the one or more instructions further cause the client computing device to obtain the user input data from the secure storage.
 19. A method of processing digital content comprising: rendering digital content by a first application on a client computing system; displaying the digital content being rendered on a display of the client computing system; accepting, by the first application, user input data from a user selecting at least one object image within at least one video frame of a sequence of video frames in the digital content being displayed, wherein the user input data comprises: (i) a location of the selected at least one object image within the at least one video frame of the digital content being displayed and (ii) an elapsed time from a start of the rendering of the digital content being displayed; storing, by a security processor of the client computing system, the user input data in a secure storage within the client computing system; sending the user input data to a second application on a server coupled to the client computing system over a network via a secure communications channel; comparing, by the second application, the user input data to entries in a content object information database on the server; when the user input data matches an entry in the content object information database, obtaining and storing, by the second application, information about an object corresponding to the selected object image; sending the object information, by the second application, to the first application over the secure communications channel; and displaying, by the first application, the object information on the client computing system.
 20. The method of claim 19, wherein the selected at least one object image comprises an image of an object captured by a camera as at least a portion of at least one of the video frames.
 21. The method of claim 20, wherein the captured object comprises a physical world object.
 22. The method of claim 21, wherein the captured object comprises a consumer good.
 23. The method of claim 21, wherein the captured object comprises a person.
 24. The method of claim 19, wherein the selected object image represents a song.
 25. The method of claim 19, wherein the object information comprises information to enable the user to purchase the object.
 26. The method of claim 19, wherein the content information database comprises an entry for each object image in the content that is supported for providing object information to the user.
 27. The method of claim 26, further comprising analyzing audio and video portions of the content, identifying object images, and creating entries in the content information database for identified object images prior to distribution of the content.
 28. The method of claim 26, further comprising analyzing audio and video portions of the content based at least in part on the user input data, identifying object images, and creating entries in the content information database for identified object images as the content is being consumed on the client computing system.
 29. The method of claim 26, further comprising dynamically changing the object information in the content information database.
 30. The method of claim 19, wherein storing, by the second application, information about an object corresponding to the selected object image comprises storing the object information in a user profile on the server, and tailoring object information to send to the client computing device based at least in part on the user profile.
 31. The method of claim 30, further comprising determining interests of the user based at least in part on content delivered to the client computing system and user inputs, and storing interests of the user in the user profile.
 32. A system comprising: a client computing system configured to: render digital content; display the rendered digital content on a display of the client computing system; accept user input data from a user selecting at least one object image within at least one video frame of a sequence of video frames in the digital content displayed on the display, wherein the user input data comprises: (i) a location of the selected at least one object image within the at least one video frame of the digital content displayed on the display and (ii) an elapsed time from a start of the rendering of the digital content displayed on the display; and store the user input data in a secure storage within the client computing system; and a server, coupled to the client computing system over a network via a secure communications channel, configured to: receive the user input data from the client computing system; compare the user input data to entries in a content object information database on the server; obtain and store information about an object corresponding to the selected object image in response to the user input data matching an entry in the content object information database; and send the object information to the client computing system over the secure communications channel; and wherein the client computing system is further configured to display the object information to the user.
 33. The system of claim 32, wherein the selected at least one object image comprises an image of an object captured by a camera as at least a portion of at least one of the video frames.
 34. The system of claim 33, wherein the captured object comprises a physical world object.
 35. The system of claim 32, wherein the object information comprises information to enable the user to purchase the object. 